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Abstract 

Objective To determine whether there is sufficient evidence to conclude 
that exercise interventions are more effective than no exercise control 
and to compare the effectiveness of different exercise interventions in 
relieving pain and improving function in patients with lower limb 
osteoarthritis. 

Data sources Nine electronic databases searched from inception to 
March 2012. 

Study selection Randomised controlled trials comparing exercise 
interventions with each other or with no exercise control for adults with 
knee or hip osteoarthritis. 

Data extraction Two reviewers evaluated eligibility and methodological 
quality. Main outcomes extracted were pain intensity and limitation of 
function. Trial sequential analysis was used to investigate reliability and 
conclusiveness of available evidence for exercise interventions. Bayesian 
network meta-analysis was used to combine both direct (within trial) and 
indirect (between trial) evidence on treatment effectiveness. 

Results 60 trials (44 knee, two hip, 14 mixed) covering 12 exercise 
interventions and with 8218 patients met inclusion criteria. Sequential 
analysis showed that as of 2002 sufficient evidence had been accrued 
to show significant benefit of exercise interventions over no exercise 
control. For pain relief, strengthening, flexibility plus strengthening, 
flexibility plus strengthening plus aerobic, aquatic strengthening, and 



aquatic strengthening plus flexibility, exercises were significantly more 
effective than no exercise control. A combined intervention of 
strengthening, flexibility, and aerobic exercise was also significantly 
more effective than no exercise control for improving limitation in function 
(standardised mean difference -0.63, 95% credible interval -1 .1 6 to 
-0.10). 

Conclusions As of 2002 sufficient evidence had accumulated to show 
significant benefit of exercise over no exercise in patients with 
osteoarthritis, and further trials are unlikely to overturn this result. An 
approach combining exercises to increase strength, flexibility, and aerobic 
capacity is likely to be most effective in the management of lower limb 
osteoarthritis. The evidence is largely from trials in patients with knee 
osteoarthritis. 

Protocol registration PROSPERO (www.crd.york.ac.uk/prospero/) No 
CRD4201 2002267. 

Introduction 

Osteoarthritis is the most common form of arthritis and one of 
the leading causes of pain and disability worldwide. 1 The 
lifetime prevalence of symptomatic hip osteoarthritis is 
estimated at 25. 3%, 2 while that of knee osteoarthritis is even 
higher at 44. 7%. 3 The economic burden of osteoarthritis is 
substantial and consists of both direct costs (such as drugs, 
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hospital care) and indirect costs (such as loss in productivity). 4 5 
Between 1999 and 2000 in the United Kingdom, about 36 
million working days were lost because of osteoarthritis alone, 
leading to an estimated £3.2bn (€3.8bn, $5. lbn) in productivity 
losses, with the total cost to the UK economy being estimated 
at 1% of gross national product per year. 1 In the United States, 
the estimated incremental cost of medical care expenditures and 
earnings losses for people with arthritis or other rheumatological 
conditions was $128 billion (£80.5bn, €95.8bn) in 2003." In 
Canada, time lost from employment and leisure by people with 
disabling hip or knee osteoarthritis and their unpaid caregivers 
has been estimated at $C12 200 (£7400, €8800, US$1 1 800) 
per person per year. 7 

The main goal of treatment for patients with osteoarthritis is to 
relieve the common symptoms of joint pain and improve 
everyday physical function. 8 Current international guidelines 
recommend therapeutic exercise (land or water based) as 
"core" 1 1,12 and effective management, given its beneficial 
effects, 8 l3 ~ 16 ease of application, few adverse effects, and 
relatively low costs. Regular exercise can reduce physical 
impairments and improve participation in social, domestic, 
occupational, and recreational activities. 14 Additional benefits 
of exercise include improvements in mobility, risk of falls, body 
weight, mental health, and metabolic abnormalities. 14 Recent 
guidance recommends both strengthening and aerobic exercise, 1 
but there are multiple other approaches to exercise such as 
stretching/flexibility, endurance training, aquatic exercise, and 
increasing general physical activity. 

Several systematic reviews, including meta-analyses, have 
examined the effectiveness of exercise interventions for 
osteoarthritis. 8 13-13 17 21 Since these reviews were published, many 
new trials have been conducted. Previous reviews have primarily 
focused on pairwise comparisons of the different exercise 
interventions within individual trials. One recent systematic 
review and network meta-analysis compared the effectiveness 
of acupuncture with other relevant physical treatments for 
alleviating pain from osteoarthritis. 22 To our knowledge no 
review has yet compared different exercise approaches relative 
to each other by using network meta-analysis in which all 
interventions are compared equally by drawing on both direct 
evidence (comparing treatments within the same trial) and 
indirect evidence (comparing treatments from different trials). 
This approach deals with a critical research gap highlighted by 
recent Cochrane Reviews, 8 13 which is the need for further studies 
to help provide evidence of the optimal types of exercise 
interventions and support decision making by patients, 
clinicians, and service commissioners. In this review, we first 
examined whether the required amount of information has been 
reached to confidently conclude that exercise is more effective 
than no exercise and that future trials need no longer examine 
this question. Secondly, we conducted a comprehensive 
synthesis using network meta-analysis methods to compare the 
effectiveness of different exercise interventions for pain and 
function in patients with lower limb osteoarthritis and to support 
evidence based recommendations regarding the content of these 
exercise programmes. 

Methods 

Information sources and search strategy 

An information specialist developed the search strategy with 
input from clinicians and academics in the reviewing team. 
Exercise interventions were defined as any type of therapeutic 
exercise (land or water based), regardless of content, duration, 
frequency, or intensity. We searched Medline, Embase, 



CINAHL, AMED, HMIC (Health Management Information 
Consortium), Cochrane Database of Systematic Reviews 
Cochrane Controlled Clinical Trials (CENTRAL), DARE 
(Database of Reviews of Effectiveness), NHS EED (Economic 
Evaluations Database), and Web of Science from inception to 
March 2012. Appendix 1 shows the full search strategies for 
Medline and Embase. The Cochrane highly sensitive search 
was used to identify randomised controlled trials. No language 
restrictions were applied. The bibliographies of relevant review 
articles and selected articles were examined for additionally 
potentially relevant trials. 

Study selection 

We evaluated each identified study against the following 
predetermined selection criteria: 

• Study population: adults with an established clinical or 
radiographic diagnosis of knee or hip osteoarthritis 
according to accepted criteria 23 

• Intervention: any therapeutic exercise intervention (land 
or water based), regardless of content, duration, frequency, 
or intensity 

• Comparator: other forms of exercise or no exercise control 
group 

• Outcome measure: in agreement with international 
consensus 24 regarding the core set of outcome measures 
for clinical trials in osteoarthritis, the trials needed to 
include assessment of at least one of self reported pain and 
function 

• Study design: randomised controlled trials. Trials were 
excluded if they concerned perioperative exercise therapy 
(immediately before or after surgery) or if intervention 
groups received identical exercise interventions (that is, 
no contrast existed between the intervention groups). 

Quality assessment 

We used the Cochrane Collaboration's tool for assessing risk 
of bias for quality assessment of the included trials. 25 The trials 
were graded (unclear, high, or low risk of bias) based on 
sequence generation, allocation concealment, blinding of 
outcome assessor, incomplete outcome data, and selective 
outcome reporting. 

Study selection and data abstraction 

In pairs, six reviewers (OAU, DAvdW, JLJ, NEF, KSD, ELH) 
independently evaluated the eligibility and methodological 
quality of each trial and extracted data. Two reviewers 
independently classified the exercise interventions, based on 
three a priori defined characteristics or common aims of exercise 
interventions in osteoarthritis: to mainly improve strength, 
flexibility, or aerobic capacity/general health. Discrepancies 
were resolved by discussion. For each included trial we extracted 
details on design, sample size, population characteristics (knee 
osteoarthritis, hip osteoarthritis, or mixed), interventions (land 
based or aquatic, duration, number of supervised sessions and 
type of exercise), and outcome assessment (type of outcome 
measure and length of follow-up) on to a specifically designed 
electronic form and recorded the results of quality assessments. 

Summary measures 

The primary measure of treatment effect was the standardised 
mean difference (SMD) for pain intensity and function. This is 
the difference in means between the two groups, divided by the 
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pooled standard deviation (SD) of the measurements. A negative 
value indicates more pain relief or improved function. We 
calculated change scores from baseline because many of the 
studies were small and there were significant baseline 
differences in scores between the treatment groups.' 3 When 
effect sizes were incompletely reported we contacted the 
corresponding author or estimated the SD on the basis of the 
sample size, median, and range, as suggested by Hozo and 
colleagues 26 or on the basis of the sample size and P value. For 
trials with multiple follow-ups, we used the longest duration 
reported. When a study reported multiple pain scales, we used 
the highest on the hierarchy of pain related outcomes as 
described by Juni and colleagues. 27 When the SDs of absolute 
changes from baseline were not available from individual trials, 
we imputed them as described in detail in the Cochrane 
Handbook. 28 In brief, we assumed a correlation of r=0.5 between 
baseline and follow-up to estimate SD for change from 
baseline. 241 Using the imputed correlation coefficient values, we 
thereafter calculated SDs for the change from baseline for the 
studies with missing SDs. We classed the effect sizes into three 
categories: 0.2-<0.3 as small, 0.3-< 0.8 as medium, and >0.8 as 
large. 30 If possible we used the intention to treat population for 
analyses. We back transformed effect sizes (SMD) for pain to 
differences on a 10 cm visual analogue scale (VAS) based on 
a median pooled SD of 2.5 cm reported in large scale trials in 
osteoarthritis that used a similar visual analogue scale for pain. 31 
Similarly, we back transformed standardised mean differences 
for function to a standardised Western Ontario and McMaster 
Universities (WOMAC) disability score 32 ranging from 0 to 10 
on the basis of a median pooled SD of 2. 1 units reported in large 
scale trials. 31 

Trial sequential analysis 

We examined the reliability and conclusiveness of the available 
evidence using trial sequential analyses (TSA). 33 3S The sample 
size required for a reliable and conclusive meta-analysis is at 
least as large as that of a single optimally powered randomised 
controlled trial. Therefore, we calculated the sample size (that 
is, the heterogeneity corrected optimal information size (HOIS)) 
required to detect or reject a minimal relevant difference of two 
points on the mean pain relief and function scales in favour of 
exercise compared with no exercise control. We then used the 
HOIS to help construct Lan-DeMets sequential monitoring 
boundaries for our cumulative meta-analyses, 36 analogous to 
interim monitoring in a randomised controlled trial, to determine 
when sufficient evidence had been accrued. We conducted the 
trial sequential analyses in TSA version 0.9 37 with an intention 
to maintain an overall 1% risk of a type I error and 10% risk of 
a type II error (power of 90%). 

Data synthesis and model implementation 

We used random effect Bayesian network meta-analysis (NMA) 
to compare the relative effectiveness of different exercise 
interventions. 38 " 41 Network meta-analysis is a generalisation of 
meta-analysis methods that allows comparisons of interventions 
not compared directly within individual primary trials. We 
accounted for the correlation between treatment effects induced 
by three arm trials using the multi-arm trial code in WinBUGS. 42 
We calculated the probability of each exercise intervention being 
the most effective (first best), the second best, the third best, 
and so on, and presented the results graphically with 
rankograms. 40 Probability values were summarised and reported 
as surface under the cumulative ranking (SUCRA). 40 SUCRA 
= 1 if an exercise intervention always ranks first and 0 if it 
always ranks last. 



We estimated the consistency between direct evidence (from 
randomised controlled trials directly comparing different 
exercise interventions) and indirect evidence by calculating the 
difference of standardised mean differences for indirect versus 
direct comparisons whenever indirect estimates could be 
constructed with a single common comparator. 43 We also 
inspected heterogeneity using Higgins' I 2 measure for each 
pairwise comparison. As we expected that some comparisons 
would be based on a larger number of trials than others, we 
calculated the percentage contribution of each estimate in the 
network on the summary estimates and on the entire network. 
The results were summarised in contribution plots by using 
weighted squares representing the percentage contribution of 
each comparison. 44 

We conducted a sensitivity analysis to examine whether effect 
estimates would be influenced by the joint investigated in the 
individual trials. As most trials focused on knee osteoarthritis, 
in this sensitivity analysis we restricted the network 
meta-analysis to knee osteoarthritis trials only. The number of 
trials focusing on hip osteoarthritis, or including patients with 
hip, knee, or other lower limb joints, was too small to carry out 
a separate network meta-analysis. Finally, we used network 
meta-regression analysis to investigate whether potential 
heterogeneity could be explained by differences in year of 
publication, 43 number of supervised sessions (as a measure of 
treatment intensity), and duration of follow-up. The 
meta-regression was performed by allowing for a common 
treatment-covariate interaction for each exercise intervention 
to no exercise control in the network meta-analysis. 45 

The network meta-analyses were conducted with a Bayesian 
Markov chain Monte Carlo method and fitted in Bayesian 
software, WinBUGS. 42 Two Markov chains were run 
simultaneously with different arbitrarily chosen initial values. 
Convergence to a stable solution was checked by viewing plots 
of the sampled simulations and use of the Brooks-Gelman-Rubin 
diagnostic tool (available on request). 46 47 Convergence was 
found to be adequate after we ran 50 000 samples for both 
chains. These samples were then discarded as "burn-in" and 
posterior summaries are based on 100 000 subsequent 
simulations. All results are reported as posterior medians of 
standard mean differences with corresponding 95% credible 
intervals (Crls). Credible intervals are the Bayesian equivalent 
of classic confidence intervals. A 95% credible interval can be 
interpreted as there being a 95% probability that the parameter 
takes a value in the specified range. 

Results 

Study selection and characteristics 

The literature search yielded 3796 articles, from which we 
selected 177 full text articles for critical reading (fig 111). Of the 
177 articles, 1 15 did not meet the inclusion criteria and were 
excluded. Sixty randomised controlled trials 21 48 ""' 7 (8218 
patients) with usable outcome data met the inclusion criteria 
and were included. Appendix 2 details the characteristics of the 
included trials. The trials were published between 1989 and 
2012. The maximum length of follow-up ranged from 4 weeks 
to 79 weeks (median 15 weeks). Most of the trials were from 
the US (n=17, 28%), followed by the UK (n=9, 15%) and 
Australia (n=8, 13%). Trials recruited participants mostly from 
local communities. Most of the trials recruited patients with 
knee osteoarthritis (n=44, 73%), two investigated hip 
osteoarthritis (4%), and 14 (23%) recruited a mix of patients 
with osteoarthritis of the hip, knee, or other joints. 
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Risk of bias of included trials 

We assessed risk of bias in all included trials (see appendix 3 
for summary and graph). The generation of the allocation 
sequence was adequate in most trials (n=42, 60%). Allocation 
concealment was adequate in almost half of the trials (n=25, 
42%). Thirty one trials (52%) masked outcome assessors to 
treatment allocation. The potential risk of bias likely to be 
introduced by incomplete data was high in 10 trials (18%). The 
risk of selective reporting bias was low in most trials (n=53, 
88%). 

Trial sequential analysis 

Trial sequential analysis showed that as of 2002 enough 
evidence had been accrued to show significant benefit of 
exercise intervention over no exercise control for both pain and 
functional improvement (see appendix 4). The cumulative z 
curve for pain and function crossed the trial sequential 
monitoring boundaries, implying that there is firm evidence for 
a beneficial effect of exercise interventions over no exercise in 
people with lower limb osteoarthritis. 

Network meta-analysis of exercise 
interventions: pain outcome 

There were 13 comparisons (12 exercise interventions plus no 
exercise controls) in the network of eligible exercise intervention 
comparisons for pain (see appendix 5). Most trials included a 
no exercise control group (n=54, 90%). The commonest 
comparison was no exercise control versus strengthening 
exercise (n=12, 20%), followed by no exercise control versus 
combined exercise (flexibility plus strengthening plus aerobic 
exercise) (n=10, 17%). No comparison had undue influence or 
contribution to the effect estimates in the entire network. The 
weighted percentage contributions of each comparison were 
fairly equally distributed in the entire network of evidence. 
Analysis based on direct versus indirect comparisons showed 
no evidence of inconsistency between direct and indirect 
evidence in the network, although some comparisons were based 
on a small number of trials. Table 1 provides information 
regarding statistical heterogeneity in pairwise comparisons (I 2 )l). 
For most comparisons 95% confidence intervals for statistical 
heterogeneity were wide and included values suggesting either 
no or large heterogeneity, which reflects the small number of 
studies available for most pairwise comparisons. 

Figure 2JJ. summarises the results of the network meta-analyses 
for the outcome measure of pain for all trials (regardless of the 
joint involved). Strengthening exercise only, strengthening plus 
flexibility, combined strengthening plus flexibility plus aerobic, 
aquatic strengthening, and aquatic strengthening plus flexibility 
were significantly more effective than no exercise control (see 
appendix 6 for pairwise comparisons). The overall difference 
in pain intensity (v control) was -2.03 cm (95% credible interval 
-2.82 to - 1 .26 cm, large effect size) on a 10 cm visual analogue 
scale for strengthening only exercise, - 1 .26 cm (-2.12 to -0.40 
cm, medium effect size) for flexibility plus strengthening 
exercise, -1.74 cm (-2.60 to -0.88 cm, medium effect size) 
for flexibility plus strengthening plus aerobic, -1.87 cm (-3.56 
to -0.17, medium effect size) for aquatic strengthening, and 
-1.87 cm (-4.11 to -0.68 cm, large effect size) for aquatic 
flexibility plus strengthening exercise. In terms of the cumulative 
probability of being the overall best exercise intervention for 
pain in lower limb osteoarthritis, aquatic strengthening plus 
aerobic flexibility exercise (81%) was closely followed by 
strengthening exercise only (76%), and aquatic strengthening 
plus aerobic exercise (73%) (fig 2).JJ. 



When we limited the analysis to trials focusing on knee 
osteoarthritis, the cumulative rankings did not change much 
(table 2;|), but effect estimates tended to be larger compared 
with the overall analysis, which also included trials focusing 
on hip osteoarthritis and trials investigating osteoarthritis in any 
joint. The results of the meta-regression analyses did not show 
any significant association between exercise effects and either 
one of the three study level covariates considered (table 2) j,l. 
Comparative effect estimates (SMDs) and cumulative rankings 
of exercise interventions also did not change appreciably in 
meta-regression analyses when we adjusted for trial publication 
year, number of supervised sessions, or duration of follow-up 
(table 2). 

Network meta-analysis of exercise 
interventions: function outcomes 

For function outcomes there seemed to be no evidence of 
inconsistency between direct and indirect evidence in the 
network, although the number of trials for some comparisons 
was small. The combined intervention of strengthening plus 
flexibility plus aerobic exercise was significantly more effective 
than no exercise controls (appendix 6). The overall difference 
in function (versus no control) was -1.32 units (95% credible 
interval -2.44 to -0.21 units, medium effect size) on a WOMAC 
disability scale ranging from 0 to 10 for the combination of 
strengthening, flexibility, and aerobic exercise. Figure 2 shows 
that this combination of three types of exercise (71%) and 
aquatic strengthening plus aerobic (71%) exercises had the 
highest probability of being the best overall treatment for 
function. [I 

Similar to the analysis of pain, when we limited the network 
meta-analysis to knee osteoarthritis trials, the cumulative 
rankings showed little change, although effect estimates were 
generally slightly larger compared with the overall analysis 
(table 31.1). The meta-regression analyses did not detect any 
significant association between exercise treatment effects and 
the three study level covariates considered (table 3). After 
adjustment for possible differences in these study level factors, 
however, combined exercise programmes including flexibility, 
strengthening, and aerobic exercise were no longer significantly 
more effective than no exercise controls. 

There were no significant differences for pairwise comparisons 
between the different types of exercise, with small to moderate 
effect sizes for both pain and function (appendix 6). Figure 3 
shows the scatterplot of cumulative probabilities of being the 
most effective exercise intervention for both pain and function, 
which indicates that it is likely that interventions consisting of 
a combination of strengthening exercises with aerobic and/or 
flexibility exercises are most effective. |J 

Discussion 

When we applied trial sequential analysis to determine the 
reliability and conclusiveness of evidence for the effectiveness 
of exercise interventions over no exercise control, it is clear that 
as of the year 2002 there was firm evidence of the effectiveness 
of exercise interventions for pain relief and improvement in 
function in patients with lower limb osteoarthritis. The totality 
of evidence, although largely based on trials in knee 
osteoarthritis shows that further trials of exercise versus no 
exercise are unlikely to overturn this positive result. 

Furthermore, the network meta-analysis showed that there is 
no significant difference in effect estimates between different 
types of exercise interventions, which could be partly explained 
by the small number of studies for several pairwise comparisons. 
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However, our data support better results for some exercise 
interventions. Based on relative effect estimates and cumulative 
ranking, interventions combining strengthening with flexibility 
and aerobic exercise (either land or water based) seemed to be 
the most effective intervention when we considered measures 
of both pain and function in lower limb osteoarthritis and was 
the only exercise intervention significantly more effective than 
no exercise control for both pain and function. The cumulative 
ranking obtained by the network meta-analysis cannot be 
considered as conclusive because it was impaired by the lack 
of significant differences among the exercise interventions. For 
example, aquatic flexibility plus strengthening exercises occupy 
the first rank position (81%) for pain but did not show 
superiority over any of the other exercise interventions. The 
lack of significant differences between different types of exercise 
interventions, however, might be because of the small number 
of studies contributing evidence to several comparisons in the 
network. 

Study strengths and limitations 

Our study has several strengths and limitations. We performed 
a comprehensive search of several databases and sources to 
identify eligible trials, with no language restriction. We reduced 
potential bias in the conduct of this review by having pairs of 
reviewers independently scan through the search output, extract 
data, classify interventions, and assess the methodological 
quality of each trial. Though our review uniquely aimed to 
determine if and when sufficient evidence for exercise 
interventions had been accrued, repeated meta-analysis with 
accumulating trial data can lead to false positive findings 
(random errors) if multiple testing is not accounted for. 34 To 
minimise random error, we calculated an information size (that 
is, required meta-analysis sample size) as large as that of an 
adequately powered trial and applied trial sequential analysis 
to the meta-analysis. 13 " 35 The results of trial sequential analysis 
can therefore also support decisions regarding the need for and 
feasibility of future trials. 1 " 8 

To our knowledge, this is the first network meta-analysis of 
exercise for lower limb osteoarthritis. It shows the usefulness 
of network meta-analysis for comparing different exercise 
interventions and for evaluating the relative effectiveness of 
each. Network meta-analysis is particularly relevant to this, 
given that exercise is a complex and multifaceted intervention 
and the number of trials comparing the same exercise 
interventions is small. Results are also likely to be more useful 
to policy makers, service commissioners, and care providers 
when they are making choices between multiple alternatives 
than results from multiple separate pairwise meta-analyses.'™ 
Another important strength of the network meta-analysis is that 
it treats all comparators as separate treatments while gaining 
statistical power from including all available data.'™ The 
Bayesian approach has the additional advantage of being able 
to provide probability statements for which exercise intervention 
is the most effective, even when standard methods might 
determine no significant difference between them. In addition, 
we calculated alternative rankings (second, third best, etc), 
because in some cases the best exercise intervention might be 
unavailable, more costly, or contraindicated in some patients. 

Our main aim was to explore the relative effectiveness of three 
main components of exercise interventions: increasing strength, 
flexibility, or endurance. We realise that this approach does not 
allow an analysis of more specific or detailed components of 
exercise interventions investigated in a small number of trials, 
such as t'ai chi, which was classified as "predominantly strength 
and flexibility." We considered adding categories (such as 



balance or relaxation) or investigating t'ai chi separately, but 
this further complicated the analysis and strongly reduced the 
statistical power of the network meta-analysis. 

As with most meta-analyses on exercise for osteoarthritis, our 
network meta-analysis could be criticised for including trials 
with varying length of follow-up periods. The different length 
of follow-up periods might have introduced some heterogeneity 
into the network meta-analysis. Methods that allow for repeated 
follow-up measurements have been reported for pairwise 
meta-analyses. The Cochrane handbook 28 makes several 
recommendations for the meta-analysis of trials that report 
multiple time points, including the use of individual patient data 
meta-analysis and assessment of one particular time point. 
Methods that allow for inclusion of all time points in a network 
meta-analysis, however, are still under development. The choice 
of method will depend on the research question of interest and 
the data available from the included studies. None of these 
methods is free from biases. We explored the influence of 
differences in the duration of follow-up using meta-regression 
analysis, which did not result in significant or substantial 
differences in effect estimates. We could also have explored the 
potential influence of characteristics of the study population 
(such as mean age, severity of osteoarthritis), but 
meta-regression has low power in detecting patient level 
covariate effects when average study level covariate values are 
used."""" 2 Similarly, it is prone to aggregation bias, a problem 
that can be avoided only with individual patient data 
meta-analysis."""" 2 A final limitation is that we focused our 
review on clinical effectiveness only; we did not attempt to 
assess the cost effectiveness of preferentially using a particular 
exercise intervention. 

Comparison with other studies 

Several systematic reviews (including three recent Cochrane 
reviews) have concluded that exercise is beneficial for patients 
with osteoarthritis of the lower limb. 8 13 21 Fransen and colleagues 
included 32 trials comparing any exercise intervention with a 
control group for knee osteoarthritis and showed that land based 
therapeutic exercise had significant benefits in terms of pain 
(SMD 0.40, 95% confidence interval 0.30 to 0.50) and function 
(0.37, 0.25 to 0.49). 13 The authors found no significant 
differences in the magnitude of treatment effects between 
difference types of exercise interventions. In another review, 
Fransen and colleagues included five trials and found a small 
treatment effect for pain but no benefit in terms of function for 
hip osteoarthritis. 8 They cautioned that the results of the 
meta-analysis should be considered inconclusive because of the 
small number of trials and marked heterogeneity. 8 Battels and 
colleagues included six trials investigating aquatic exercise 
interventions and found a small to moderate effect on function 
(SMD 0.26, 95% confidence interval 0.1 1 to 0.42) and large 
effect on pain (0.86, 0.25 to 1.47) for combined hip and knee 
osteoarthritis. 15 The authors emphasised that caution is needed 
in recommending aquatic exercise interventions because of the 
limited number of trials in the meta-analysis and suggested that 
trials with clearly defined patient groups with long term 
outcomes are needed to further test aquatic exercise. 15 Contrary 
to previous meta-analyses that concluded no clear superiority 
of one form of exercise over another in improving pain and 
function, 13 14 18 our study provides new, though tentative, 
evidence about the most effective exercise interventions for 
patients with lower limb osteoarthritis. It should be noted that 
most trials included in this meta-analysis focused on participants 
with knee osteoarthritis, and the results are therefore more 
applicable to such patients. The sensitivity analysis, which 
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included only trials in knee osteoarthritis, showed slightly larger 
effect estimates compared with the overall analysis, regardless 
of the type of exercise. Once further evidence has accumulated 
about the role of exercise in hip osteoarthritis, a similar network 
meta-analysis might provide insight into the relative 
effectiveness of different types of exercise interventions in such 
patients. 

Implications for practice 

We believe our findings are both clinically and scientifically 
important. They suggest that some exercise interventions are 
likely to be more effective than others in helping achieve 
improvements in pain and function in patients with lower limb 
osteoarthritis. Exercise programmes that combine strengthening 
exercise with exercise aimed at increasing flexibility and aerobic 
capacity seem to be the "best" exercise option that clinicians 
can offer to patients. Current guidelines for osteoarthritis, such 
as those in the UK,' report there is limited evidence for the 
benefits of one type of exercise over another and recommend 
both strengthening and aerobic exercise as "core" treatment. 
Osteoarthritis Research and Society International (OARSI) also 
recommended that osteoarthritis patients should be encouraged 
to undertake regular aerobic, muscle strengthening, and range 
of motion exercises.' 2 113 The results of our study provide 
evidence to support this recommendation. 

Implications for research 

One paper cannot deal with all questions pertinent to the 
evaluation of exercise for lower limb osteoarthritis. With respect 
to further empirical trials, research is needed to better identify 
how best to support patients with lower limb osteoarthritis to 
adhere to exercise and physical activity, which patients with 
lower limb osteoarthritis benefit most from exercise, and 
whether some modes of delivery and support of exercise lead 
to better outcomes. Additional questions include determining 
the long term clinical and cost effectiveness of exercise 
interventions for lower limb osteoarthritis and how best to ensure 
that the beneficial short term effects of exercise are maintained 
over time. 
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What is already known on this topic 

Current international guidelines recommend therapeutic exercise (land or water based) as "core" and effective management of osteoarthritis, 
given existing evidence regarding the beneficial effects of exercise on pain and function, ease of application, few adverse effects, and 
relatively low costs 

What this study adds 

As of the year 2002, there was enough evidence to show significant benefit of exercise interventions versus no exercise in lower limb 
osteoarthritis, indicating that further trials of exercise versus no exercise are unlikely to overturn this positive result 

Evidence from the network meta-analysis, largely based on studies in knee osteoarthritis, indicates that a programme combining flexibility, 
strengthening, and aerobic exercise is most likely to improve outcomes of pain and function 
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Tables 



Table 1 1 Statistical heterogeneity (I 2 ) for all pairwise comparisons in studies of exercise for treatment of lower limb osteoarthritis 





Pain relief 


Physical function 


Pairwise comparison No of studies 


l 2 (%) (95% CI) 


No of studies 


l 2 (%) (95% CI) 


Compared with no intervention 


Flexibility 


2 


0.0 


1 


NA 


Strengthening 


16 


91.5 (87.8 to 94.1) 


16 


91.9(88.4 to 94.3) 


Aerobic 


3 


22.8 (0.0 to 92.0) 


2 


0.0 (NA) 


Flexibility+strengthening 


11 


90.6 (85.3 to 94.0) 


10 


86.1 (76.3 to 91 .8) 


Flexibility+aerobic 


3 


0.0 (0.0 to 89.6) 


3 


0.0 (0.0 to 89.6) 


Strengthening+aerobic 


3 


65.9 (0.0 to 90.2) 


3 


0.0 (0.0 to 89.6) 


Combined 


12 


81 .9 (69.5 to 89.2) 


10 


83.4 (70.9 to 90.5) 


Aquatic: 


Strengthening 


3 


0.6 (0.0 to 89.7) 


3 


10.2 (0.0 to 90.7) 


Flexibility+strengthening 


2 


79.3 (NA) 


1 


NA 


Flexibility+aerobic 


2 


72.1 (NA) 


2 


23.6 (NA) 


Strengthening+aerobic 


1 


NA 


1 


NA 


Combined 


3 


83.3 (49.3 to 94.5) 


3 


0.0 (0.0 to 89.6) 


Compared with flexibility 


Strengthening 


3 


0.0 (0.0 to 89.6) 


3 


0.0 (0.0 to 89.6) 


Compared with strengthening 


Aerobic 


1 


NA 


1 


NA 


Flexibility+strengthening 


1 


NA 


1 


NA 


Aquatic: strengthening 


1 


NA 


1 


NA 


Compared with flexibility+strengthening 


Combined 


2 


0.0 


2 


73.1 (NA) 


Aquatic: 


Strengthening 


1 


NA 


1 


NA 


Flexibility+strengthening 


2 


80.8 


2 


0.0 (NA) 


Combined 


1 


NA 


1 


NA 


Compared with flexibility+aerobic 


Aquatic: flexibility+aerobic 


1 


NA 


1 


NA 


Compared with combined 


Aquatic: 


Strengthening+aerobic 


1 


NA 


1 


NA 


Combined 


3 


33.0 (0.0 to 93.0) 


3 


0.0 (0.0 to 89.6) 


NA=not assessed. 
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Table 2| Results of network meta-analysis of pain relief, without covariates, and additionally with adjustment for follow-up period, trial 
publication year, and number of sessions, with meta-regression. Figures are standardised mean difference (SMD) with 95% credible 
intervals and probability for cumulative ranking with surface under cumulative ranking (SUCRA) 





No adjustment 




Covariates adjusted 




SMD (95% Crl) Prob (SUCRA) 


SMD (95% Crl) 


Prob (SUCRA) 


Lower limb (knee, hips or multiple joints) 


No intervention control 


Reference 


11 


Reference 


12.2 


Flexibility 


-0.66 (-1.33 to -0.00) 


61.4 


-0.62 (-1.31 to 0.07) 


61.2 


Strengthening 


-0.81 (-1.13 to -0.50) 


76.3 


-0.74 (-1.09 to -0.39) 


73.4 


Aerobic 


-0.41 (-1.13 to 0.30) 


42.3 


-0.33 (-1.08 to 0.40) 


39.3 


Flexibility+strengthening 


-0.50 (-0.85 to -0.16) 


47.6 


-0.45 (-0.84 to -0.06) 


45.7 


Flexibility+aerobic 


-0.26 (-1.00 to 0.47) 


32.3 


-0.22 (-1.00 to 0.56) 


31.7 


Strengthening+aerobic 


-0.13 (-0.88 to 0.61) 


24.2 


-0.26 (-1.07 to 0.56) 


34.4 


Combined 


-0.69 (-1.04 to -0.35) 


65.9 


-0.64 (-1 .04 to -0.26) 


64.9 


Aquatic: 


Strengthening 


-0.75 (-1.42 to -0.07) 


67.6 


-0.68 (-1.38 to 0.02) 


65.1 


Flexibility+strengthening 


-0.96 (-1.64 to -0.27) 


81.3 


-0.91 (-1.62 to -0.21) 


80.9 


Flexibility+aerobic 


-0.07 (-0.98 to 0.83) 


22.7 


0.01 (-0.95 to 0.97) 


19.8 


Strengthening+aerobic 


-0.92 (-2.08 to 0.25) 


72.9 


-0.87 (-2.05 to 0.31) 


72.2 


Combined 


-0.45 (-1.02 to 0.11) 


44.5 


-0.47 (-1.09 to 0.14) 


49.1 


Follow-up period' 






-0.001 (-0.009 to 0.007) 




Publication yeart 






-0.01 1 (-0.048 to 0.025) 




No of sessions^ 






0.003 (-0.002 to 0.009) 




Knee only 


No intervention control 


Reference 


11.9 


Reference 


12.6 


Flexibility 


-0.74 (-1.48 to -0.00) 


57.3 


-0.70 (-1.50 to 0.09) 


57.1 


Strengthening 


-0.94 (-1.31 to -0.57) 


72.8 


-0.88 (-1 .31 to -0.46) 


71.2 


Aerobic 


-0.43 (-1.22 to 0.35) 


38.4 


-0.38 (-1.22 to 0.46) 


37.2 


Flexibility+strengthening 


-0.63 (-1.13 to -0.12) 


49.0 


-0.54 (-1.12 to 0.04) 


45.6 


Flexibility+aerobic 


-0.33 (-1.17 to 0.50) 


32.7 


-0.30 (-1.22 to 0.60) 


33.1 


Strengthening+aerobic 


0.02 (-0.99 to 1 .02) 


17.5 


-0.16 (-1.42 to 1.11) 


28.9 


Combined 


-0.79 (-1.21 to -0.38) 


61.5 


-0.74 (-1 .23 to -0.25) 


60.3 


Aquatic: 


Strengthening 










Flexibility+strengthening 


-1.25 (-2.39 to -0.11) 


81.0 


-1.17 (-2.38 to 0.04) 


78.6 


Flexibility+aerobic 


-0.42 (-1 .76 to 0.92) 


40.3 


-0.39 (-1.80 to 1.03) 


40.3 


Strengthening+aerobic 


-0.97 (-2.25 to 0.32) 


67.0 


-0.92 (-2.26 to 0.43) 


66.0 


Combined 


-1.04 (-2.33 to 0.24) 


70.6 


-0.99 (-2.33 to 0.35) 


69.2 


Follow-up period' 






-0.00 (-0.010 to 0.010) 




Publication year" 






-0.01 1 (-0.054 to 0.033) 




No of sessions" 






0.002 (-0.005 to 0.010) 





*Follow-up period (in weeks) centred at mean value (28.7 weeks). 

tPublication year centred at 2000. 

$No of sessions centred at mean value (29.8 sessions). 
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Table 3| Results of network meta-analysis Improvement of function, without covariates, and additionally with adjustment for follow-up 
period, trial publication year, and number of sessions, with meta-regression. Figures are standardised mean difference (SMD) with 95% 
credible intervals and probability for cumulative ranking with surface under cumulative ranking (SUCRA) 





No adjustment 


Covariates adjusted 




SMD (95% Crl) 


Prob (SUCRA) 


SMD (95% Crl) 


Prob (SUCRA) 


Lower limb (knee, hips or multiple joints) 


No intervention control 


Reference 


22.7 


Reference 


23.5 


Flexibility 


-0.17 (-1.26 to 0.93) 


40.1 


-0.1 4 (-1.26 to 0.97) 


38.8 


Strengthening 


-0.37 (-0.84 to 0.09) 


52.8 


-0.36 (-0.87 to 0.14) 


52.1 


Aerobic 


-0.30 (-1.53 to 0.92) 


47.9 


-0.46 (-1.71 to 0.80) 


55.9 


Flexibility+strengthening 


-0.40 (-0.92 to 0.12) 


54.6 


-0.38 (-0.98 to 0.22) 


52.9 


Flexibility+aerobic 


-0.18 (-1.24 to 0.89) 


40.7 


-0.27 (-1.40 to 0.86) 


46.0 


Strengthening+aerobic 


-0.1 7 -1.25 to 0.91) 


40.0 


-0.26 (-1.46 to 0.94) 


45.6 


Combined 


-0.63 (-1.16 to -0.10) 


70.9 


-0.57 (-1.1 7 to 0.03) 


66.8 


Aquatic: 


Strengthening 


-0.43 (-1.42 to 0.56) 


55.2 


-0.34 (-1.36 to 0.68) 


49.7 


Flexibility+strengthening 


-0.61 (-1 .75 to 0.52) 


64.9 


-0.57 (-1.73 to 0.58) 


62.7 


Flexibility+aerobic 


0.07 (-1.23 to 1.36) 


29.5 


0.13 (-1.24 to 1.50) 


27.1 


Strengthening+aerobic 


-0.86 (-2.52 to 0.79) 


71.4 


-0.79 (-2.47 to 0.90) 


68.5 


Combined 


-0.49 (-1.32 to 0.33) 


59.5 


-0.51 (-1.40 to 0.39) 


60.3 


Follow-up period' 






0.009 (-0.003 to 0.021) 




Publication yeart 






-0.001 (-0.056 to 0.054) 




No of sessions^: 






-0.000 (-0.009 to 0.008) 




Knee only osteoarthritis 


No intervention control 


Reference 


23 


Reference 


22.4 


Flexibility 


-0.22 (-1.36 to 0.92) 


39.8 


-0.23 (-1.39 to 0.93) 


38.9 


Strengthening 


-0.43 (-0.94 to 0.07) 


51.6 


-0.46 (-1.02 to 0.10) 


51.5 


Aerobic 


-0.32 (-1.58 to 0.94) 


45.4 


-0.54 (-1.83 to 0.75) 


54.3 


Flexibility+strengthening 


-0.65 (-1.38 to 0.07) 


63.5 


-0.73 (-1.56 to 0.11) 


65.6 


Flexibility+aerobic 


-0.21 (-1.33 to 0.91) 


39.6 


-0.32 (-1.51 to 0.87) 


43.9 


Strengthening+aerobic 


-0.08 (-1.43 to 1.28) 


34.5 


-0.08 (-1.88 to 1.72) 


36.1 


Combined 


-0.70 (-1.32 to -0.09) 


67.1 


-0.60 (-1.28 to 0.10) 


58.9 


Aquatic: 


Strengthening 










Flexibility+strengthening 


-0.95 (-2.51 to 0.61) 


71.3 


-1.02 (-2.63 to 0.59) 


72.7 


Flexibility+aerobic 


-0.10 (-1.89 to 1.68) 


37.6 


-0.12 (-1.94 to 1.71) 


37.4 


Strengthening+aerobic 


-0.90 (-2.61 to 0.81) 


68.2 


-0.80 (-2.55 to 0.94) 


63.6 


Combined 


-0.64 (-2.35 to 1.08) 


58.4 


-0.57 (-2.32 to 1.17) 


54.7 


Follow-up period' 






0.012 (-0.002 to 0.026) 




Publication yeart 






0.006 (-0.055 to 0.067) 




No of sessions:): 






-0.003 (-0.015 to 0.009) 





*Follow-up period (in weeks) centred at mean value (27.8 weeks). 

tPublication year centred at 2000. 

JNo of sessions centred at mean value (31 .4 sessions). 
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Figures 



Records identified through database searching (n=6415) 
\ 

Records after duplicates removed (n=3796) 
J 

Records screened (n=3796) 



— — Records excluded based on title or abstract (n=3619) 

Full text articles assessed for eligibility (n=177) 

Full text articles excluded based on full text (n=115): 
No relevant population (n=12) 
No relevant intervention (n-18) 
No relevant control (n=23) 
— »- No relevant outcomes (n=14) 

Not a randomised controlled trial (n=22) 
Protocol (n=10) 
Review article (n=9) 
Commentary (n=7) 



F 



Studies included in qualitative synthesis (n=62) 

Duplicate publications (n=2) 

Studies included in quantitative synthesis 
(network meta-analysis) (n=60) 



Fig 1 Study selection of trials examining exercise in treatment of lower limb osteoarthritis 
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Exercise regimen Prob(SUCRA) 
Pain 

No intervention control 
Flexibility 
Strengthening 
Aerobic 

Flexibility and strengthening 
Flexibility and aerobic 
Strengthening and aerobic 
Combined 
Aquatic: 
Strengthening 

Flexibility and strengthening 

Flexibility and aerobic 

Strengtheningand aerobic 

Combined 
Physical function 
No intervention control 
Flexibility 
Strengthening 
Aerobic 

Flexibility and strengthening 
Flexibility and aerobic 
Strengthening and aerobic 
Combined 
Aquatic: 
Strengthening 

Flexibility and strengthening 
Flexibility and aerobic 
Strengtheningand aerobic 
Combined 



SMD (92% Crl) 




SMD (92% Crl) 



Reference 
■1.33 to -O.OO) 
1.13 to -0.50) 
-1.13 to 0.30) 
0.85 to -0.16) 
-1.00 to 0.47) 
-0.88 to 0.61) 
1.04 to -0.35) 

1.42 to -0.07) 
1.64 to -0.28) 
-0.98 to 0.83) 
-2.08 to 0.25) 
-1.02 to 0.11) 

Reference 
■1.26 to 0.93) 
■0.84 to 0.09) 
■1.53 to 0.92) 
•0.92 to 0.12) 
■1.24 to 0.89) 
-1.25 to 0.91) 
1.16 to -0.10) 

■1.42 to 0.56) 
-1.75 to 0.52) 
-1.23 to 1.36) 
■2.52 to 0.79) 
-1.32 to 0.35) 



Favours 

exercise no 
regimen control 

Fig 2 Forest plots for network meta-analysis for pain and physical function outcomes with no exercise as reference group. 
SMD=standardised mean difference, SUCRA=surface under cumulative ranking. SUCRA=1 when exercise intervention is 
certain to be best (that is, always ranks first) and 0 when exercise intervention is certain to be worst 



100 

w A: Aerobic; Aq: Aquatic; C: Control (no exercise) 

0 F: Flexibility, S: Strengthening 

1 80 Aq: S+A 
i 9 • 

t Aq: F+S+A F+s+A 0 Aq: F+S 

£ * Aq:S 9 • 

= S+A W A S 

■a 40 9 • • 

| F + A 

- » Aq: F+A 



20 



08 0 20 40 60 80 100 

Ranking probability for pain (%) 

Fig 3 Scatter plot presenting ranking of exercise interventions for pain reduction and physical function based on cumulative 
probability of being most effective intervention 
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